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ABSTRACT 



This research describe in this paper is a new approach of partial query processing based on a SAX query parser 
and Query engine. It first loads the compressed XML document and then processes. The brief description of proposed 
scheme is presented, followed by the design. This begins with the introduction discussion experimental results, followed by 
its evaluation and compressions with some other existing technologies. 
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INTRODUCTION 

Query processing plays a vital role in the database system. However little work has been done to study the 
effective compression and query processing in XML documents. Query processor refers to a collective set of strategies that 
processes the queries in some sort of time. The basic component of any query processor is the parser and the Query engine. 
Query engine attempt to resolve the query. Search engine and the Parser is heart of the query processor. Search engine 
searches the exact match for the incoming queries and then it optimize it for the result. 

The compression time for XML database is calculated by running the same database For multiple times and 
average is taken for consideration of result. The main reason for doing this is to reduce the disk I/O influences on the 
results by loading the whole document into the physical memory.. Calculations of CR1 and CR2 are done using the 
following formulas- 

The compression ratio is defined as follows 

CRl=(Size of Compressed File *8)/Size Of Original File 

CR2=l-(size of Compressed file /Size of Original File)* 100 

Compression Ratio Factor (CRF):- Normalize the Compression Ratio of XVSGC with Respect to XMill and 
XGRIND 



CRF 1=CRXVSGC/XMILL 
CRF 2=CRXVSGC/XGRIND 

Compression Time Factor (CRT):- Normalize the Compression Time of XVSGC with Respect to XMill and 



XGRIND. 



CRT 1 =CRTXVSGC/XMILL 



CRT 2=CRTXVSGC/XGRIND 
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EXPERIMENTAL EVALUATION 

The scheme of search engine, PARSER, proposed LZ base compressed query processing and studied for different 
load. The performance metrics use to mesure the performance of the query processor are minimum time taken to execute 
the result. 

Query Response Time (QRT): Time Required T to execute the query 

The results of the evaluation for the different document are shown in following table. And comparative graphs are 

shown. 

Each query is fired more than 5 time for the same purpose. In above table we shows the five query and three 
different time period for the partial processing. At last we shows the average of that time as our result. 
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Figure 1: Qurey Response Time of XVSGC for 321gone 
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Figure 2: Qurey Responce Time of XVSGC for Ebay 

Above figure 2 shows the graphical representation of the query response time for the XVSGC for Ebay. Graph is 
drawn using the our exprimental results. 
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Figure 3: Qurey Response Time of XVSGC for DBLP ( Publications) 



Impact Factor (JCC): 6.8785 



Index Copernicus Value (ICV): 3.0 
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Above figure 3 shows the graphical representation of the query responce time for the XVSGC for publication. 
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Figure 4: Qurey Response Time of XVSGC for Yahoo 

Above figure 4 shows the graphical representation of the query resfponce time for the XVSGC for Yahoo. Graph 
is drawn using the our exprimental results. 
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Figure 5: Qurey Response Time of XVSGC for Shakesphere 

Above figure 5 shows the graphical representation of the query responce time for the XVSGC for Shakesphere. 
Graph is drawn using the our exprimental results. 
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Figure 6: Qurey Response Time of XVSGC for Treebank 

Above figure 6 shows the graphical representation of the query responce time for the XVSGC for Treebank. 
Graph is drawn using the our exprimental results. 
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Figure 7: Qurey Response Time of XVSGC for Swissport 

Above figure 7 shows the graphical representation of the query response time for the XVSGC for Swissport. 
Graph is drawn using the input from our experiments. 

Table 1: Performance over Regular Documents 



Name 


Q. No 


DTI (in 

Sec) 


DT2 (in 

Sec) 


DT3 (in 

Sec) 


AVG(Xvsgc) 
(in Sec) 


Weblog 


Ql 


0.275 


0.212 


0.246 


0.244333333 


Q2 


0.114 


0.245 


0.168 


0.175666667 


Q3 


0.12 


0.133 


0.146 


0.133 


Q4 


0.185 


0.226 


0.252 


0.221 


Q5 


0.275 


0.288 


0.27 


0.277666667 



Table 2: Partial Query Performance of XVSGC 



Name 


O. Size 


C. Size 


Q. No 


DTI (in 

Sec) 


DT2 (in 

Sec) 


DT3 (in 

Sec) 


AVG(Xvsgc) 
(in Sec) 


321 gone 


24.9 


22.5 


Ql 


0.131 


0.7 


0.3 


0.377 


Q2 


0.2 


0.4 


0.6 


0.4 


Q3 


0.19 


0.5 


0.14 


0.276666667 


Q4 


0.5 


0.8 


0.4 


0.566666667 


Q5 


0.6 


0.21 


0.8 


0.536666667 


Ebay 


34.7 


41.6 


Ql 


0.14 


0.2 


0.2 


0.18 


Q2 


0.9 


0.35 


.0.7 


0.625 


Q3 


0.5 


0.7 


0.6 


0.6 


Q4 


0.13 


0.7 


0.5 


0.443333333 


Q5 


0.12 


0.5 


0.3 


0.306666667 



Above Table 2 shows Query performance XVSGC over irregular document, (time for the XVSGC for 321gone 
and Ebay.) 
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Figure 8: Query Performance OF XVSGC with Others of Textual Documents 
Impact Factor (JCC): 6.8785 Index Copernicus Value (ICV): 3.0 
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Above figure 8 shows the Query response time for the different compressor. From the above figure we can conclude 
that Query response time of XVSGC is better than the XGRIND and the more than Xqzip and Xqzip+ as it is a non 
quarable compressor. 

CONCLUSIONS 

For this work, some of the XML compressor were examined and at the same time some query evaluation techniques 
are investigated with the objective of making efficient processing of XML Queries. 

The proposed work intent to achieve 'optimal', efficient,' fast' query processing, and efficient query evaluation in 
XML compression. 

Query processor plays an important role in the database management system. After evaluation it is found that our 
approach gives improved performance, Query response time of XVSGC is better than XGRIND and comparable to Xqzip 
and Xqzip+. 

In wireless applications like PDA's, Smartphone's, Palmtops etc, non-querible compressor are not advantageous 
since querying data is important. From the results it is better choice to use XVSGC tool as it is giving better results than 
XGRIND. 

Finally it can be concluded that using the proposed technique of compression of XML databases and query 
evaluation over it, will better improve the performance of XML database systems. 
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APPENDICES 

Query Response time for different XML document 



Table 3 



Name 


y. No 


Dll (in Sec) 


D 1 2 (in Sec) 


D 1 3 (in Sec) 


AVCx(Xvsgc) (in Sec) 


321 gone 


Qi 


0.131 


0.7 


0.3 


0.377 


Q2 


0.2 


0.4 


0.6 


0.4 


Q3 


0.19 


0.5 


0.14 


0.276666667 


Q4 


0.5 


0.8 


0.4 


0.566666667 


Q5 


0.6 


0.21 


0.8 


0.536666667 


Ebay 


Ql 


0.14 


0.2 


0.2 


0.18 


Q2 


0.9 


0.35 


.0.7 


0.625 


Q3 


0.5 


0.7 


0.6 


0.6 


Q4 


0.13 


0.7 


0.5 


0.443333333 


Q5 


0.12 


0.5 


0.3 


0.306666667 


Publication 


Ql 


0.45 


0.01 


0.01 


0.156666667 


Q2 


0.59 


0.26 


0.25 


0.366666667 


Q3 


0.1 


0.1 


0.1 


0.1 


Q4 


0.38 


0.2 


0.3 


0.293333333 


Q5 


0.8 


0.5 


0.3 


0.533333333 


Yahoo 


Ql 


0.24 


0.12 


0.12 


0.16 


Q2 


0.11 


0.5 


0.8 


0.47 


Q3 


0.7 


0.7 


0.4 


0.6 


Q4 


0.12 


0.5 


0.8 


0.473333333 


Q5 


0.6 


0.9 


0.9 


0.8 


Shakesphere 


Ql 


0.327 


0.41 


0.281 


0.339333333 


Q2 


0.487 


0.295 


0.396 


0.392666667 


Q3 


0.32 


0.355 


0.357 


0.344 


Q4 


0.361 


0.326 


0.361 


0.349333333 


Q5 


0.54 


0.52 


0.435 


0.498333333 


Treebank 


Ql 


0.674 


0.6 


0.66 


0.644666667 


Q2 


0.171 


0.16 


0.17 


0.167 


Q3 


1.003 


1 


1 


1.001 


Q4 


0.985 


0.9 


0.98 


0.955 


Q5 


0.453 


0.45 


0.4 


0.434333333 


Swissport 


Ql 


0.814 


0.814 


0.81 


0.812666667 


Q2 


0.81 


0.81 


0.8 


0.806666667 


Q3 


0.71 


0.8 


0.71 


0.74 


Q4 


0.7 


0.69 


0.7 


0.696666667 


Q5 


0.7 


0.71 


0.71 


0.706666667 


Weblog 


Ql 


0.275 


0.212 


0.246 


0.244333333 


Q2 


0.114 


0.245 


0.168 


0.175666667 


Q3 


0.12 


0.133 


0.146 


0.133 


Q4 


0.185 


0.226 


0.252 


0.221 


Q5 


0.275 


0.288 


0.27 


0.277666667 


Mondial 


Ql 


1.611 


1.317 


1.349 


1.425666667 


Q2 


0.6 


0.7 


0.5 


0.6 


Q3 


0.28 


0.2 


0.12 


0.2 


Q4 


0.24 


0.12 


0.12 


0.16 


Q5 


0.06 


0.09 


0.11 


0.086666667 



Above table shows the Query response time. In experiment we have taken more than five queries for the better 

result. 
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